Binary Neural Networks Algorithms, Architectures, and Applications (Baochang Zhang, Sheng Xu, Mingbao Lin etc.)

BiRe-ID: Binary Neural Network for Eﬃcient Person Re-ID

151

BiConv.

PReLU

BiConv.

PReLU

Cross

Entropy

FR-GAL

܉௜ିଵ

Kernel Refining GAL

ݏ݅݃݊ሺڄሻ

܊^܉^೔షభ

ܶܧ

ܟ௜

ݏ݅݃݊ሺڄሻ

܊^ܟ^೔

ܶܧ

ߙ௜

܉௜

Discriminator

MSE loss

܉௅

Feature Refining GAL

Low-level

Feature

Discriminator

MSE loss

܉ு

High-level

Feature

ͳ ൈͳ ܥ݋݊ݒǤ

݂ሺڄሻ

FIGURE 6.1

An illustration of BiRe-ID based on KR-GAL and FR-GAL, applying Kernel Reﬁning

Generative Adversarial Learning (KR-GAL) and Feature Reﬁning Generative Adversarial

Learning (FR-GAL). KR-GAL consists of the unbinarized kernel wi, corresponding bina-

rized kernel b^wⁱ, and the attention-aware scale factor αi. αi is employed to channel-wise

reconstruct the binarized kernel b^wⁱ. We employ conventional MSE loss and a GAN to fully

reﬁne wi and αi. FR-GAL is a self-supervision tool to reﬁne the features of the low-level

layers with the semantic information contained by the high-level features. To compare the

features of the low- and high-level parts, we employ a 1×1 convolution and nearest neighbor

interpolation f(·) to keep the channel dimension identical. Then the high-level features can

be utilized to reﬁne the low-level feature through a GAN.

6.2

BiRe-ID: Binary Neural Network for Eﬃcient Person Re-ID

This section proposes a new BNN-based framework for eﬃcient person Re-ID (BiRe-

ID) [262]. We introduce the kernel and feature reﬁnement based on generative adversarial

learning (GAL) [76] to improve the representation capacity of BNNs. Speciﬁcally, we ex-

ploit GAL to eﬃciently reﬁne the kernel and feature of BNNs. We introduce an attention-

aware factor to reﬁne the 1-bit convolution kernel under the GAL framework (KR-GAL).

We reconstruct real-valued kernels by their corresponding binarized counterparts and the

attention-aware factor. This reconstruction process is well supervised by GAL and MSE

loss as shown in the upper left corner of Fig. 6.1.

Furthermore, we employ a self-supervision framework to reﬁne the low-level features

under the supervision of the high-level features with semantic information. As shown in

the upper right corner of Fig. 6.1, we use a feature-reﬁning generative adversarial network

(FR-GAL) to supervise the low-level feature maps. In this way, the low-level features will

be reﬁned by the semantic information contained in the high-level features to improve the

training process and lead to a suﬃciently trained BNN.

6.2.1

Problem Formulation

We ﬁrst consider a general quantization problem for deeply accelerating convolution oper-

ations to calculate the quantized or discrete weights. We design a quantization process by